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We present a search for the standard model Higgs boson in final states with an electron or muon 
and a hadronically decaying tau lepton in association with two or more jets using 9.7 fb -1 of Run II 
Fermilab Tevatron Collider data collected with the DO detector. The analysis is sensitive to Higgs 
boson production via gluon fusion, associated vector boson production, and vector boson fusion, 
followed by the Higgs boson decay to tau lepton pairs or to W boson pairs. The ratios of 95% C.L. 
upper limits on the cross section times branching ratio to those predicted by the standard model 
are obtained for orthogonal subsamples that are enriched in either H —¥ rr decays or H — > WW 
decays, and for the combination of these subsample limits. The observed and expected limit ratios 
for the combined subsamples at a Higgs boson mass of 125 GeV are 11.3 and 9.0 respectively. 

PACS numbers: 13.85.Rm, 14.80Bn 



I. INTRODUCTION 

The standard model (SM) of particle physics postu- 
lates a complex Higgs doublet field as the source of 
electroweak symmetry breaking, giving rise to non-zero 
masses of the W and Z vector bosons and the funda- 
mental fermions. The mass of the spin-zero Higgs boson, 
H, that survives after the symmetry breaking is not pre- 
dicted by the SM, but is constrained by direct searches at 
the LEP Q, Tevatron and LHC % H colliders, and 
by precision electroweak measurements [5| to be in the 
range 122 - 127 GeV at the 95% C.L. The boson observed 
at a mass of about 126 GeV by ATLAS and CMS 0, Q 
when combining evidence for a narrow resonance in the 
77 and ZZ channels has production and decay properties 
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that are consistent with the SM Higgs boson, given the 
current sensitivities. Evidence for a particle compatible 
with the discovered boson with the decay bb has been 
reported independently by the Tevatron experiments 0- 
8]. It is now important to measure its couplings for all 
accessible particles, in particular to leptons for which no 
evidence currently exists. 

In this paper we present a search for the Higgs boson 
in final states that are sensitive to both the H — » tt 
and H — > WW decay modes containing a hadronically 
decaying tau lepton, a muon or electron plus at least 
two jets. We use data collected with the DO detector 
at the Fermilab proton-antiproton Tevatron Collider at 
\/s = 1-96 TeV. The search is conducted at ten Higgs 
boson masses (Mh) between 105 and 150 GeV in 5 GeV 
intervals. In the following represents a hadronically 
decaying tau lepton, I denotes a lepton (fi or e) and j 
represents hadronic jets. The symbols ll /J.Tjf and "erjj" 
denote the two individual search channels, collectively 
described as "^rjj" . 

Over the mass range 115 < Mh < 150 GeV the Higgs 
boson decay branching fractions vary considerably, with 
H — » bb being the dominant decay for Mh Ss 135 GeV 
and H — > W + W~ becoming important for Mh <; 135 
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GeV. Below M H < 125 GeV, H -> t+t~ has an appre- 
ciable (s» 8%) branching fraction. Previous analyses by 
the DO and CDF Collaborations have mainly focused on 
the decay modes H — » bb in the low mass region and 
H — > WW with both W bosons decaying to a lepton and 
neutrino in the high mass region 0. A DO publication @ 
reported a Higgs boson search in the r^/x final state with 
zero or one jet, and the fJ.rjj and erjj final states, us- 
ing 7.3, 6.2 and 4.3 ftr 1 of data respectively. The CDF 
collaboration has published a search in the Th& + > 1 
jet final state using 6.0 ftr 1 of data [l(J. Here we re- 
port updated results for both the fATjj and erjj searches 
with the full DO 9.7 ftr 1 2002 - 2011 data set with an 
improved analysis using an extended set of variables for 
discriminating signal and background and an improved 
multivariate analysis. The current £rjj analyses super- 
sede those of Ref . @ . 

The Higgs boson production processes considered are 
(i) gluon fusion (GF), gg — » H (+ jets); («) vector boson 
fusion (VBF), qq — > qqH; (Hi) associated vector boson 
and Higgs boson production (VH), qq — ► VH, where V 
is a W or Z boson, and V — > qq; and (iv) associated 
Higgs boson and Z boson production (HZ), qq —> HZ, 
with H -> bb and Z -> tt. The GF, VBF, and VH 
processes are further subdivided according to the Higgs 
boson decay, H — » tt or H — > WW, and these subchan- 
nels are denoted as GF rT , GFww, etc. 

Tau leptons can occur at lower Mjj through direct de- 
cays of the Higgs boson or, at higher Mh, indirectly from 
H — » VV with V — > t + X. The leptons may arise from 
t decay or, at high Mh, directly from V decay. Thus 
the irjj channels are more uniformly sensitive to Higgs 
boson production over the full mass range investigated 
than are the dedicated H — > bb or H — >• WW — > Itvv 
analyses, thus improving the combined search sensitivity 
particularly in the intermediate mass region around 135 
GeV. 



II. THE DO DETECTOR 

The DO detector [lll - fl3T ] contains tracking, calorime- 
ter and muon subdetector systems. Silicon microstrip 
tracking detectors (SMT) near the interaction point cover 
pseudorapidity \r\\ < 3 to provide tracking and vertexing 
information. The SMT [lj, [lj| contains cylindrical bar- 
rel layers aligned with their axes parallel to the beams 
and disk segments. The disks are perpendicular to the 
beam axis, interleaved with, and extending beyond, the 
barrels. The central fiber tracker (CFT) surrounds the 
SMT, providing coverage to about |ry| =2. The CFT has 
eight concentric cylindrical layers of overlapped scintillat- 
ing fibers providing axial and stereo (±3°) measurements. 
A 1.9 T solenoid surrounds these tracking detectors. 

Three uranium liquid-argon calorimeters measure par- 
ticle energies. The central calorimeter (CC) covers \r/\ < 
1, and two end calorimeters (EC) extend coverage to 
about \rj\ — 4. The calorimeter is highly segmented 



along the particle direction, with four electromagnetic 
(EM) and four or five hadronic sections in depth, and 
segmentation transverse to the particle direction with 
typically Ar/ = Acj) = 0.1, where <f> is the azimuthal 
angle (Arj = A<f> — 0.05 in the third EM depth seg- 
ment). The calorimeters are supplemented with central 
and forward scintillating strip preshower detectors (CPS 
and FPS) located in front of the CC and EC. Intercryo- 
stat detectors (ICD) provide added sampling in the re- 
gion 1.1 < \r)\ < 1.4 where the CC and EC cryostat walls 
degrade the calorimeter energy resolution. 

Muons are measured just outside the calorimeters, and 
twice more outside 1.8 T iron toroidal magnets, over the 
range \r/\ < 2. Each measurement is based on scintil- 
lation counters and several layers of tracking chambers. 
Scintillators surrounding the exiting beams allow deter- 
mination of the luminosity [lj|. 

A three level trigger system selects events for data log- 
ging at about 100 Hz. The first level trigger (LI) is based 
on fast custom logic for several subdetectors and is ca- 
pable of making decisions after each beam crossing. The 
second level trigger (L2) makes microprocessor-based de- 
cisions using multi-detector information. The third level 
trigger (L3) uses fully digitized outputs from all detec- 
tors to refine the decision and select events for offline 
processing. 

The data collected for this analysis comes from the 
full Tevatron Run II period extending from 2002 to 2011. 
The data set is divided into two epochs, Runs Ha and lib. 
Between these two epochs, substantial upgrades were 
made to the detector, including the addition of a new 
radiation-hard silicon strip detector close to the beam 
line and upgrades to the trigger system. The instanta- 
neous luminosity of the Tevatron increased substantially 
between Run Ha and Run lib. The integrated luminosi- 
ties are 1.0 and 8.7 ft -1 for the two epochs respectively. 



III. TRIGGER 

The /J,rjj data were collected using all triggers em- 
ployed in DO. We reject events in which a muon candidate 
points toward the region of impaired coverage due to the 
detector supports. The trigger efficiency was determined 
in two steps. In the first step the efficiency for a suite of 
single muon triggers was measured using a tag and probe 
analysis of a sample of Z — > /i/i events and found to be 
about 65% for that sample. In Run Ha, the single muon 
triggers require the muon transverse momentum pj, > 12 
GeV and \rj%\ < 2.0, where rfa is the pseudorapidity cal- 
culated assuming the muon originated at the center of 
the detector. Owing to the higher instanteous luminos- 
ity in Run lib, the trigger requirements were tightened to 
p^ > 15 GeV and \r]%\ < 1.6. The single muon trigger ef- 
ficiency is parameterized as a function of the rjd and <j) of 
the muon and instantaneous luminosity (as well as p^ in 
Run Ha). The background and signal events simulated 
by Monte Carlo (MC) are weighted by these efficiency 
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functions. 

In the second step, we measure the ratio, lZ a \\, of the 
£tj j signal sample data events collected with all triggers 
to those with single muon triggers, after subtracting the 
expected multijet component from both. (The signal and 
multijet samples are discussed in Sections PVTI and IVlTl ) 
We examine the dependences of this ratio upon the pr 
and X] of the fi, Th and leading (highest pr) jet. No sig- 
nificant dependences are observed, and a constant TZ a u is 
used as an additional weighting factor for the efficiency 
of the MC samples. The use of this inclusive trigger ap- 
proach gives an increase in the data sample of about 40% 
compared to that from the single muon triggers alone. 

For the erjj analysis, we employ a set of triggers that 
require an EM object and a jet. The efficiency of the elec- 
tron components of these triggers is obtained from a tag 
and probe analysis of Z — > ee events and is parametrized 
in terms of electron px and r\d- The efficiency of the 
jet component is measured in events selected by a single 
muon trigger in which a jet is reconstructed offline; the 
jet trigger term efficiency is then determined as a func- 
tion of jet pt and r)d on the basis of whether or not the 
corresponding muon plus jet trigger condition is satisfied. 
The impact of the correlation between electron and jet 
portions of the trigger is small. The trigger efficiency is 
about 85% for the signal processes. 



IV. BACKGROUND AND SIGNAL SAMPLES 

The major backgrounds for the Higgs boson search are 
Z and W bosons produced in association with jets, tt, 
and QCD multijet production (MJ) in which a jet sim- 
ulates a lepton or hadronically decaying tau. Smaller 
backgrounds arise from boson (W, Z or 7) pair produc- 
tion ( "diboson" ) and single top quark production which 
is included with the tt background. All but the MJ back- 
ground arc simulated using MC event generator programs 
and normalized to the highest order theoretical calcula- 
tions available. These are referred to below as "SM" 
backgrounds. The MC simulations use the CTEQ6LI 
parton distribution functions (PDF) [17] . 

The Z + jets and W + jets MC event samples are gen- 
erated by ALPGEN [3, interfaced to PYTHIA [lji[ to pro- 
vide initial and final state radiation and the hadroniza- 
tion of the produced partons. The p^ distribution is 
reweighted to agree with the DO measurement [20| . The 
Pt is also reweighted using the same experimental in- 
put, corrected for the theoretical differences between W 
and Z bosons expected in next-to-next-to-leading order 
(NNLO) QCD [ly. The Z + jets and W + jets cross sec- 
tions are normalized using the calculations of Ref. [22[ 
and the MSTW 2008 NNLO PDFs [Hj]. 

We simulate tt and single top quark events using the 
ALPGEN and SINGLETOP 



24( generators respectively, 



with the parton hadronization provided by PYTHIA. The 
normalizations are based on approximate NNLO QCD 
calculations (25|. The diboson events are generated by 



PYTHIA and normalized with MCFM [26j . 

Higgs boson production is simulated using PYTHIA, 
with normalizations taken from Ref. Q. The Higgs bo- 
son decays are simulated using HDECAY (27[, and the r 
decays are obtained from TAUOLA [2Sj . 

The MC signal and background events are passed 
through the GEANT3-based [29| simulation of the de- 
tector response. Prior to reconstructing the MC events 
with the standard data programs, we superimpose events 
from a library of data events collected from random beam 
crossings to account for detector noise and pileup from 
additional pp collisions in the same or previous bunch 
crossings. The difference between the luminosity distri- 
bution for the random events and our data sample is 
encoded in a weight factor applied to the MC events. 
Simulated events are also weighted to account for the 
differences between MC and data for the lepton, tau, 
and jet identification efficiencies and for the energy scale 
and resolution of jets, in addition to the trigger weights 
discussed in Section [TTTT 



V. OBJECT SELECTION CRITERIA 

Muon candidates are required to have hits in the muon 
chambers before and after the toroidal magnets, and to 
be matched to a track in the tracking system. Muons 
must be isolated from additional energy deposits in both 
the calorimeter and the tracking system. We require the 
calorimeter transverse energy, E^°, in the annular cone 
0.1 < R < 0.4 around the muon to be less than 2.5 GeV, 
where R = \J Ar; 2 + A0 2 , and require that the sum of 
the transverse momenta of tracks within a cone R < 0.5, 
excluding that of the candidate muon, be less than 2.5 
GeV. We reject cosmic ray induced muon candidates by 
requiring that a time of arrival at the muon system scin- 
tillation counters is within 10 ns of that expected for colli- 
sion products. The muon isolation selections are reversed 
for a special MJ control sample discussed in Section fVIII 

Electrons are identified using information from the EM 
and hadronic calorimeters, tracking detectors and the 
preshower detectors to form a combined electron iden- 
tification variable, £. The main component of £ is a like- 
lihood variable, C e , defined using eight individual vari- 
ables: the x 2 f° r the transverse and longitudinal shower 
profile to conform to that expected for an EM shower; the 
fraction of calorimeter energy observed in the EM layers; 
the number of CPS strips hit; the \ 2 01 the track match 
to the calorimeter cluster centroid; the ratio of track mo- 
mentum and calorimeter cluster energy; the number of 
tracks in a cone R < 0.05 around the electron; the sum 
of the transverse momenta of tracks within R < 0.4 of the 
candidate track; and the distance of closest approach of 
the track to the primary vertex (PV). The rms width of 
the calorimeter cluster and the isolation of the calorime- 
ter cluster from nearby energy also contribute to the de- 
termination of (. We require ( to exceed a threshold that 
is different for electrons in the CC and EC. The identi- 
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fication efficiencies are parametrized in terms of p-r, r\d 
and 4>. At px — 25 GeV, the efficiencies for CC electrons 
are about 83% and are about 50% for EC electrons, as 
measured in Z — > ee events. In addition, electron can- 
didate tracks in the CC region are required to impinge 
upon a calorimeter module within the central 80% of its 
azimuthal range. The C e variable requirement is modi- 
fied for the MJ control sample described in Section IVlII 

We select three types of hadronically decaying tau lep- 
tons based on the number of tracks within a cone R < 0.3 
and the number of EM sub-clusters found in the calorime- 
ter using a nearest neighbor algorithm. Type 1 requires 
one track and no EM subclusters. Type 2 requires one 
track and at least one EM subcluster. Type 3 requires at 
least two tracks with or without EM subclusters. Type 
3 candidates with exactly two tracks of opposite charge 
sign, for which the tau charge is ambiguous, are rejected. 
The visible t% transverse energy, Ej, , is constructed from 
the track momenta, augmented with EM calorimeter in- 
formation. We require the sum of the track momenta 
associated with the th (p?) to exceed (7, 5, 7) GeV, 
E T T > (12.5, 12.5, 15) GeV, and (j> T T jE T T ) > (0.65, 0.5, 0.5) 
for tau types (1, 2, 3) respectively, and the highest pr 
track for type 3 taus must exceed 5 GeV. We construct 
a neural network, NN r [3(3], based on energy deposi- 
tion patterns and isolation criteria in the calorimeter 
and tracking systems for each tau type to discriminate 
a tau from a misidentified jet. Figure [T] shows illustra- 
tive NN T distributions taken from a control data sample 
of Z — V tt candidates. We require NN T to be greater 
than (0.92, 0.90, 0.91) for tau types (1, 2, 3) respectively. 
For type 2 taus, we construct a second neural network, 
NN r / e , to differentiate taus from electrons. The distri- 
bution of NN r / e , taken from the erjj sample described 
in Section I VII is shown in Fig. [TJd) . We select events 
with NN r/e > 0.5. 

Jets are reconstructed using an iterative midpoint cone 
algorithm [3l| with a cone size R = 0.5. We require at 
least two tracks associated with the jet that point to the 
PV (vertex confirmation) in Run lib due to the higher 
multiplicity of collisions within a bunch crossing. Jet en- 
ergies are corrected to the particle level for out-of-cone 
showering, underlying event energy deposits and pileup 
from neighboring beam crossings, and for the effects of 
energy carried by muons and neutrinos when there is ev- 
idence for semileptonic decays of the jet particles. Jets in 
data are corrected for energy scale and resolution using 
7+jet and dijet samples. The MC jets are corrected for 
energy scale and resolution, as well as for the jet identi- 
fication efficiency to bring the MC responses into agree- 
ment with data. For the MC samples rich in quark jets 
(it and diboson), there is an additional calibration ap- 
plied to the jet energy that accounts for the differences 
between the responses of quark jets and the dominantly 
gluon jets for which the jet energy scale correction was 
obtained. 

The missing transverse energy, If}?, is computed from 
the observed transverse energy deposits in the calorime- 



ter and is adjusted for the appropriate energy scale cor- 
rections for all objects, for isolated muons that deposit 
less than their full energy in the calorimeter, and for 
the unclustered energy in the calorimeters not associated 
with jets or EM objects. We define a quantity S that 
measures the significance of the Jf,T to be different from 
zero, based on the measured resolutions of the compo- 
nents of the IpT calculation [32j . 

VI. EVENT SELECTION CRITERIA 

We select a sample of events ("signal sample") with 
the criteria given below. Some of these differ for the Run 
Ha and Run lib selection owing to the differences in the 
detector, triggers and luminosity, and the fact that the jet 
vertex confirmation was not applied in Run Ha, leading 
to differences in the modelling of jet related variables. 

For the /xrjj selection we require: 

• a muon as defined in Section [V] with pt^ > 12 (15) 
GeV and \rft\ < 2.0 (1.6), for Run Ha (Run lib); 

• at least one hadronic tau as defined in Section [V] 
with < 2. The tau candidate with the high- 
est pr is chosen for the analysis, and must have a 
charge sign that is opposite to the muon; 

• two jets with |?7^ ot | < 3.4; the leading jet ("jetl") 
is required to have p^* 1 > 20 GeV and a second- 
leading jet ("jet2") to have p^ t2 > 15 GeV; 

• no other electron with pf, > 10 GeV and no other 
muon with p^, > 10 GeV to retain orthogonality to 
other DO searches for the SM Higgs boson; 

• the scalar sum of all jet pt's in the event (Ht) must 
be greater than 80 GeV for Run Ha to improve the 
modelling of jet related variables. 

For the erjj selection we require: 

• an electron as defined in Section [V] with pj, > 15 
GeV, and \rj e \ < 1.1 or 1.5 < \r) e \ < 2.5; 

• at least one hadronic tau as defined in Section [V] 
with \rf^\ < 2. The tau with the highest pr is cho- 
sen for the analysis, and must have a charge sign 
that is opposite to the electron; 

• no type 1 taus with 1.05 < < 1.5, and type 2 
taus to have NN r / e > 0.5, to reduce the Z(ee)+jets 
background in which an electron is misidentified as 
a tau; 

• two jets with |?/ J d ct | < 3.4; p J ^" > 25 (20) GeV for 
Run Ha (Run lib) and p^ t2 > 15 GeV; 

• no other electron with p e T > 12.5 GeV and no muon 
with p^. > 12 GeV and \r/^\ < 2.0 for orthogonality 
to other DO SM Higgs boson searches; 
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FIG. 1: (color online) The NN T distribution for (a) type 1 taus, (b) type 2 taus, (c) type 3 taus, using data taken from a 
sample of Z — > tt events with no requirements on the number of associated jets, and (d) the NN T/ / e distribution for tau type 
2 from a sample of erjj events. 



• the TfiT significance variable is required to be S > 3 
(2) for Run Ha (Run lib) to reduce the MJ and 
Z — > ee backgrounds. 

Events with the same selection criteria except that the 
I and t>i have the same charge sign are retained for es- 
timating the MJ background in both the prjj and erjj 
analyses (SS signal samples). 

VII. MULTIJET BACKGROUND 

The MJ background arising from misidentification of 
leptons or taus by the detector reconstruction algorithms 
is difficult to simulate, so it is estimated using data. We 
define a sample of MJ-enriched events (MJ control sam- 
ple, At) which is large compared with the size of the sig- 
nal sample. We use the events in A4 , after subtraction of 
the small residual SM backgrounds simulated by MC, to 
provide the shapes of the MJ background kinematic dis- 
tributions. We also use the M. sample to obtain the MJ 
background normalization. For each tau type, the SM 
subtracted sample M. is divided into the opposite sign 
iTh (OS) and same sign £7% (SS) samples. The ratio of 
OS and SS events in M is used to scale the number of MJ 
events in the SS signal sample to obtain the normaliza- 
tion for the MJ background in the signal sample. The MJ 
yield in the SS signal sample is obtained after subtract- 
ing the small SM backgrounds. Denoting the number of 



events in A4 by M and the number of events in the SS 
or OS signal samples by N, the method is expressed by: 

<s J = P(^ss ata -^s S s M ) 

with 

p = (A4T - Mgr)/(^s d s ta " MO 1 ) . 

This background estimate is computed separately for 
each tau type and summed to give the total MJ back- 
ground. 

For the /J,rjj analysis, the MJ sample M. is obtained 
by reversing at least one of the muon isolation require- 
ments. The tau selection requirements are the same as 
for the signal sample. The MJ purity in this sample is 
about 97%. We observe no significant dependence of the 
p factors on the px or 77 of p, t>, or jets, and therefore 
take them to be constant. Their values are within about 
15% of unity and are similar for the three tau types, 
and in Run Ha and Run lib. We observe that the mod- 
elling of the shapes of variables which employ the jet pt's 
needs improvement and so we adopt a reweighting for the 
MJ events based upon the comparison of the distribu- 
tion shapes in the SS signal sample and the MJ control 
sample. We find that the simple reweighting function 
Ae~ BHT , fitted to the ratio of the Ht distributions for 
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the SS signal and MJ control samples, gives adequate 
modelling. 

For the erjj analysis, M. is obtained by requiring the 
tau selection 0.3 < NN r < 0.9, and by placing an upper 
bound of 0.85 on the C e in the electron identification. In 
forming M, no requirement is made on the I^t signifi- 
cance. S. For the Run lib data, the cut on S in the signal 
selection leaves a negligibly small MJ contribution in the 
SS signal sample after the SM background subtraction. 
Owing to the absence of jet vertex confirmation, some 
MJ background remains in Run Ha and the procedure 
used in the fxrjj analysis for its estimation is followed, 
except that no Ht reweighting is needed. For Run Ha, 
the MJ purity in M is 98% and the p values are about 
1.25. 



VIII. EVENT YIELDS 

The numbers of data and expected background events 
are given in Table U for the fiTjj and erjj analyses. Tak- 
ing into account the systematic uncertainties on the back- 
grounds (see Section [Xj) and uncertainties on the MC 
statistics, the predicted backgrounds are in agreement 
with the observed yields. The erjj yields are smaller 
than those for prjj primarily because of the requirements 
made for the erjj analysis on S, r) T , 4>e in the CC region, 
and NN r / e to reduce the MJ and Z^ee backgrounds. 
Representative expected signal yields for the nine pro- 
duction and decay processes are given in Table [TlJ 

TABLE I: For each analysis channel, the number of back- 
ground events expected from SM processes, MJ background, 
and observed in data, for individual and sum of all tau types 
after preselection, "type" denotes r type, "Vj" denotes W 
or Z + jets and "DB" denotes diboson processes. The un- 
certainty on the sum of all backgrounds includes both MC 
statistical and systematic uncertainties. 



type 


tt 


Wj Z u j 


Z TT j DB 


MJ 


E Bkd 


Data 


firjj analysis 


1 


15.3 


10.2 


4.4 


37.1 2.3 


39.1 


108.4±7.4 


119 


2 


121.3 


65.2 


29.3 


241.8 14.5 


135.4 


607.5±47.1 


684 


3 


20.2 


39.1 


4.4 


54.5 3.2 


50.6 


172.1±12.4 


187 


All 


156.9 


114.5 


38.1 


333.4 16.0 


225.1 


888.0±49.2 


990 


erjj analysis 


1 


4.5 


4.6 


0.0 


9.8 0.9 


3.1 


23.0±1.8 


15 


2 


57.7 


64.9 


66.6 


91.7 8.3 


1.7 


290.8±21.0 


261 


3 


27.2 


47.2 


2.4 


28.5 3.7 


14.6 


123.7±9.6 


124 


All 


89.4 


116.7 


69.1 


130.0 12.9 


19.4 


437.5±23.2 


400 



IX. MULTIVARIATE ANALYSIS 

The number of background events greatly exceeds the 
expected number of Higgs boson signal events so we 
employ multivariate techniques that discriminate signal 
from background by taking into account the correlations 



among the variables. The multivariate strategy for this 
analysis is complicated by the presence of many distinct 
signals of comparable size, each with its own character- 
istic kinematic properties. 

We tested several artificial learning techniques includ- 
ing neural networks and decision trees using the TMVA 
suite of programs [33] to implement the multivariate 
methods. We find an optimum performance with the 
gradient boosted decision tree classifier (BDT) which of- 
fers the advantage over neural networks that the use of 
variables that do not discriminate significantly between 
a particular signal and background, or are highly corre- 
lated with other variables, do not compromise the classi- 
fier performance. In the BDT [H], IH[ approach, a series 
of splittings of the event sample is made at a sequen- 
tial set of nodes into background-like and signal-like sub- 
sample nodes. The splitting is based upon the purity 
of signal and background events in a given node N and 
its signal-like and background-like daughter nodes S and 
B. Purity is defined as p = s/(s + b) where s (b) is the 
number of signal (background) events in the node. In 
the training, the event category is known either from the 
MC generation or MJ control samples. The optimum 
splitting is achieved by maximizing the decrease of the 
Gini index 36] i = 2p(l — p) = 2sb/(s + b) 2 going from 
the parent node N to the two daughter nodes S and B, 
considering all choices of input variable and cut thresh- 
olds for that variable. Each such subsample is subjected 
to further splitting until the sample sizes reach a pre- 
set value. At each node, a random sampling of events 
is chosen from the full sample to help mitigate the ef- 
fects of finite statistics. The training is recursive with 
misclassified events in one cycle being reweighted for the 
next cycle. The TMVA training is controlled by parame- 
ters such as the maximum number of trees, the degree of 
reweighting in successive cycles, the fraction of the full 
sample used at each node and the number of cycles al- 
lowed. We varied these parameters around their nominal 
settings to obtain optimum values for our analysis. 



A. BDT variables 

We examine a large set of potentially discriminating 
kinematic variables with which to train the multivariate 
analyses. We choose a subset of well-modelled variables 
for which the agreement of data with expected back- 
ground is good and which discriminate between at least 
one individual signal and background. In these distri- 
butions the expected signal contribution to the data is 
small. Table IIIII shows the variables used for both the 
M r ii an d erjj analyses. 

In Table IIIII Ht is the scalar sum of all jets with 
p T > 15 GeV, S T = H T + \p e T \ + \Prl and v t is the 
magnitude of the vector transverse momenta of £, Th and 
all jets. The variable^fr is the magnitude of the vectorial 
sum of all jet transverse momenta, and jTy is the magni- 
tude of the vector sum of all tracks emanating from the 
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TABLE II: For each analysis channel, the number of signal events expected for each of the nine production and decay processes, 
prior to the separation into the T and W subsamples discussed in the text. 



M H 


HZ 


ZH rT 


WH TT 


GF rT 


VBF Tr ZH^y^/ 




GFww 




Total 












firjj analysis 












105 


0.19 


0.47 


0.71 


0.66 


0.37 


0.03 


0.04 


0.10 


0.01 


2.58 


115 


0.15 


0.38 


0.57 


0.53 


0.34 


0.09 


0.15 


0.08 


0.05 


2.34 


125 


0.10 


0.27 


0.40 


0.45 


0.26 


0.19 


0.37 


0.18 


0.14 


2.35 


135 


0.06 


0.16 


0.23 


0.30 


0.18 


0.33 


0.63 


0.31 


0.24 


2.43 


145 


0.03 


0.08 


0.11 


0.17 


0.10 


0.47 


0.81 


0.59 


0.37 


2.72 


erjj analysis 


105 


0.14 


0.34 


0.53 


0.32 


0.26 


0.01 


0.01 


0.02 


0.00 


1.63 


115 


0.11 


0.31 


0.47 


0.32 


0.24 


0.04 


0.05 


0.01 


0.01 


1.56 


125 


0.07 


0.22 


0.33 


0.30 


0.21 


0.09 


0.14 


0.04 


0.04 


1.46 


135 


0.05 


0.15 


0.21 


0.21 


0.15 


0.17 


0.26 


0.11 


0.08 


1.38 


145 


0.02 


0.07 


0.11 


0.12 


0.08 


0.25 


0.35 


0.16 


0.11 


1.26 



TABLE III: Variables used for firjj and erjj analyses in BDT 
training. As indicated, three variables are used for only one 
analysis. 



transverse momentum variables 
~pr\t) 
Pt(t) 
Pt(j'i) 
Pt(£t# t ) 
Ht 
St 

Vt 

angular variables 
&v(ji, h) 
Ai?(ji,J 2 ) 
A</>(£t, hh) 

N>& ji) 

A4>(pT, f T ) 
mmA(j){^T, I or r) 
maxA0(^T, I or r) 
minA0(#T,ji or j 2 ) 
maxA0($T,ji or j'2) 
cos 

mass variables 
M(jl j2) 
M(tt) 

Mww 
M(£rjlj2) 

M T ($T,e) [erjj only] 
M t ($t,t) 
M T {$T,er) 
irrinMr(#r, I or r) 
other variables 

jVsoln (tt) 
N soln {WW) 

$t/Ht 
A(lpT, I/It) 
S 

NN r [iirjj only] 

electron identification, £ [erjj only] 
M H 



primary vertex. The variable A(fx, fir) is the ratio of 
the difference and the sum of J^t and Ifix- Variables 
A</)(o, b) are the difference in azimuthal angle between 
object a and the object, or pair of objects, b, and simi- 
larly for the A77 and AR variables. The variable 9 is the 
angle between the dijet system and the proton beam di- 
rection in the laboratory frame. Variables M(a b...c) are 
invariant masses of objects a, 6, ...c and Mt($t> a>) is the 
transverse mass computed from M T — 2E^ IpT ( 1 — cos 
where 4> is the azimuthal angle between TpT and the ob- 
ject, or pair of objects, a. 

The M(tt) and M(WW) variables are determined us- 
ing the Missing Mass Calculator (MMC) method [13] ■ 
These masses are under-constrained owing to the neu- 
trinos from t or W decay. We compute the most likely 
tt mass by scanning over a grid in the 3-dimensional 
space of the azimuthal angle separations of the visible 
r decay products and the neutrino(s) for each r, and 
the invariant mass of the multiple neutrinos from one of 
the t's (e.g. r — > tvV), given the constraints from the 
measured momenta of the visible decay products and the 
known t mass. At each grid point, the calculated tt 
mass is weighted by the probability for finding the AR 
at that point between the visible particles and the neu- 
trinos. For H — > WW in the mass region considered, one 
W is virtual. The MMC mass is calculated for the WW 
system taking the mass of the virtual W to be the most 
likely value (38 GeV) for M H = 125 GeV. The variables 
Nsain (tt) and N so i n (WW) are the number of physical 
mass solutions found in the grid search. Figure [5] shows 
the data and background distributions of representative 
BDT input variables. 

One additional variable, the mass of the hypothesized 
Higgs boson, Mjj, is used in training the BDTs as dis- 
cussed below in Section IIXC1 



B. Separation of T and W subsamples 

We perform a separation of the data, SM MC back- 
grounds and MJ background into two subsamples: one 



10 




1.5 2 2.5 3 3.5 4 4.5 



1 ■ 1 i ■ 1 1 

- data / b j 
Z+jets v ' 
| W+jets 



ill " 
-i-i- ■ multijets 

^■H H diboson 

T Li — signal (x 100)" 

Tfe + 



100 150 
Mjj [GeV] 



180 



200 250 



1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 i 1 1 1 1 . 

D0L=9.7fb 1 ^^ a ! a (C) j 

■ Z+jets 

■ W+jets - 

■ tf 

■ multijets 

■ diboson 
— signal (x 100) : 




100 150 200 
1^ [GeV] 



250 300 



100- 



D0 L=9.7 fb 




data / d ) 
Z+jets v ' 
W+jets 
tt 

■ multijets 

■ diboson 

— signal (x 100) 



80 

70 

^ 60 
O 
o 50 



(A 



40 



a 30 
m 20 



10 



p T [G°eV] 



120 



D0 L=9.7 fbl 



(e) 



+4 



■ ■ ■ i 

- data 

Z+jets 
I W+jets 

I ft 

I multijets 
I diboson 
— signal (x 100). 




50 



100 150 
NW fc,er) [GeV] 



350 
300 

250 H- 



D0 L=9.7 fb 1 



"T" 



t: 



(f) 



■ diboson " 

— signal (x 100) j ■g^UUt" 

i H 

J 100E- 

ik - - 50 b^ 

J ^ 1 1 1 * '1 n> 

50 200 250 V 10 



• data 
Z+jets 

■ W+jets — 

■ tt 

■ multijets - 

■ diboson 

— signal (x 100) ~ 



20 30 
Ns„ ln (TT) 



40 



50 



FIG. 2: (color online) Input variables for the firjj BDT training: (a) Ar](ji, 32), (b) M(jl j2), (c) M(rr), and for the erjj 
BDT training: (d) pr(ji), (e) Mt($t,It), (f) N^tt). 



constructed to be rich in H —> tt signals (the T sub- 
sample), and another rich in H — > WW signals (the W 
subsample) . We perform this separation into orthogonal 
data sets with a BDT (BDTtw) based on the variables 
listed in Table IIIII using a TMVA training between the 
H — y tt and H —> WW MC signal events. Each subsam- 
ple subsequently undergoes its own multivariate analysis 
and the results are combined at the limit-setting stage. 
The subsample separation gives about 15 - 20% improve- 
ment in final Higgs boson limits over the no-separation 
case. It also provides an analysis for a purified H — > tt 
signal, thus giving additional information on fermionic 
decays of the Higgs boson. We employ the BDTtw only 
to define the T and W subsamples; no further use of this 
variable is made. The BDTtw distributions for /iTjj and 
erjj are shown in Fig. [3J 

We impose a cut to separate the T and W subsamples 
at BDTtw = +0.3 for the fxrjj analysis and at —0.6 
for the erjj analysis. The purity of tt decays in the T 
subsample and of WW decays in the W subsample are 
about 90% in the regions where the respective signals 
dominate. The dominant backgrounds in the T subsam- 
ple are Z + jets and (for the nrjj analysis only) MJ. For 
the W subsample, the dominant backgrounds are MJ, tt, 
W + jets and (for the erjj analysis only) Z(ee)+jets. 



C. Global BDTs 

In searches for the Higgs boson, separate multivariate 
analyses have generally been performed for each Higgs 



boson mass under consideration. Due to variations in 
the details in the BDT training, this can lead to fluc- 
tuations in limits from one mass point to another. We 
have constructed a method that reduces such unwanted 
fluctuations. 

We first perform a single training using the Higgs bo- 
son signal MCs at all mass points ("global BDT"). In 
this training, each signal event is characterized by the 
set of variables given in Table IIIII including the value 
of Mh appropriate to each specific signal sample. To 
prevent the classifier from artificially separating back- 
grounds from signals based on the Mh value, we ran- 
domly assign a Mh value to the backgrounds with a Mh 
distribution constructed to reproduce that of the signal 
samples. This assignment is done separately for the T 
and W subsamples and takes into account that there is 
an admixture of H — > tt and H — > WW decays in both 
cases. The Mh variable does not play a strong role in 
separating signals from backgrounds in the global BDT 
training. 

The global BDT is characterized by a set of training 
weights for the splitting of signal and backgrounds at 
each of the nodes of the tree. We then form the final dis- 
criminant at each Mh by passing the MC signal events 
for a particular Mh through the weights provided by the 
global BDT, now with Mh set to the value under consid- 
eration. In this pass, the backgrounds are also provided 
with this specific Mh value, and the data events to which 
the distributions are to be compared are similarly pro- 
vided with the test Mh value. The global BDT approach 
removes the variation in the training at different Mh val- 
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FIG. 3: (color online) BDTtw outputs from the training of 
H — !> tt vs. H — > WW for (a) the fJ.Tjj analysis and (b) the 
erjj analysis. The solid (dashed) lines show the sum of all 
production processes with H — > tt (H — > WW) decays at 
Mh = 125 GeV. Events near BDTtw = — 1 are dominantly 
H — !> WW and those near +1 are mainly H — >• tt. 



FIG. 4: (color online) Final discriminant distributions for the 
/j,rjj analysis for (a) the T subsample at Mh = 125 GeV and 
(b) the W subsample at Mh = 145 GeV. 



X. SYSTEMATIC UNCERTAINTIES 



ues and provides a more uniform distribution of Higgs 
boson cross section limits with minimal (~ 10%) deterio- 
ration in limits relative to separate training at each mass 
point. 



D. Choice of BDT binning 

If the subsample BDT distributions have bins with a 
small number of background events, there can be sta- 
tistical fluctuations in the calculated limits. These are 
reduced by choosing BDT bin sizes that ensure that all 
bins have at least 20 background events before applica- 
tion of event weights. 

Representative final discriminants are shown for the 
T subsample (at M H = 125 GeV) and the W sample 
(at M H = 145 GeV) in Figs. H and [S] for the firjj and 
erjj analyses respectively. The agreement of predicted 
backgrounds and the observed data is good. 



A large number of systematic uncertainties have been 
considered, often broken down separately by analysis 
channel and subsample, tau type, or background or sig- 
nal process. The luminosity and trigger uncertainties 
are obtained from separate analyses of DO data. The 
lepton and tau identification uncertainties are obtained 
from special samples enriched in Z boson decays. The jet 
energy scale, energy resolution, identification and vertex 
confirmation uncertainties are obtained from special dijet 
and 7+jet samples separately for the T and W subsam- 
ples. Uncertainties in the SM background cross section 
normalizations and shapes are obtained using theoretical 
uncertainties, and the extent to which special data sam- 
ples enriched in each background process agree with MC 
predictions. The MJ background uncertainties are deter- 
mined by comparing the results using the MJ-enriched 
samples with those obtained using the SS signal sample 
after SM background subtraction. Signal cross section 
uncertainties are obtained from theoretical estimates and 
include the effect of PDF uncertainties. Table [IV] summa- 
rizes the systematic uncertainties on the various sources. 
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FIG. 5: (color online) Final discriminant distributions for the 
erjj analysis for (a) the T subsample at Mr = 125 GeV and 
(b) the W subsample at M H = 145 GeV. 



TABLE IV: The range of systematic uncertainties (in per- 
cent) from different sources. Type N (S) denote normalization 
(shape) uncertainties on the final discriminant. "XS" denotes 
"cross section". The "jets" uncertainties represent the in- 
dependent uncertainties arising from jet vertex confirmation, 
jet identification and efficiency, jet energy resolution and jet 
energy scale which, for a given channel and subsample, are 
similar. 



Source 


Type 


Uncertainty (% 


Luminosity 


N 


6.1 


/i ID/track match/isolation 


N 


2.9 


e ID /isolation 


N 


4.0 


Single (i trigger efficiency 


N 


5 


All trigger/Single /i trigger 


N 


7 


e+Jets trigger efficiency 


N 


2 


r h selection (by type) 


N 


5.5/4.0/6.0 


Th energy scale 


N 


9.8 


Th track efficiency 


N 


1.4 


W/Z+jets XS 


N 


6.0 


tt, single top XS 


N 


7.0 


diboson XS 


N 


6.0 


VH signal XS 


N 


6.2 


VBF signal XS 


N 


4.9 


GF signal XS normalization 


N 


33 


GF signal XS PDF 


N 


29 


jets nrjj T (W) subsample 


S 


2 - 11 (1 - 11) 


jets erjj T (W) subsample 


S 


4 - 20 (2 - 15) 


PDF (signals) 


N 


1.6 


PDF (backgrounds) 


N 


2.0 


MJ normalization 


N 


5.3 


erjj MJ normalization 


N 


5.0 


lirjj MJ shape 


S 


5-10 



For each source, the impact on the final discriminant is 
assessed by changing the appropriate parameters by 1 s.d 
from the nominal values. Some of the uncertainties af- 
fect only the normalization of the final discriminant dis- 
tribution and some modify its shape while keeping the 
normalization fixed. 



XI. CROSS SECTION LIMITS 

The upper limits on the Higgs boson cross section for 
each analysis are obtained from the final discriminants 
for Higgs boson masses between 105 and 150 GeV in 
5 GeV increments obtained with the modified frequen- 
tist method of Ref. 38], using a negative log likelihood 
ratio (LLR) for the background-only and signal-plus- 
background hypotheses as the test statistic. The LLR 
plots, individually for T and W subsamples and for their 
sum, are shown in Fig. [6] 

The impact of systematic uncertainties on the limits 
is reduced by maximizing a "profile" likelihood func- 
tion [39j in which these uncertainties are constrained 
to Gaussian priors to give a best fit to the data. The 



appropriate correlations are retained (for example, the 
VH cross section uncertainty is fully correlated across 
the [iTjj and erjj analyses and the T and W sub- 
samples). The value of the Higgs boson cross section 
is adjusted in each limit calculation until the value of 
CL S reaches 0.05, corresponding to the 95% C.L., where 
CL S = CL s +b/CLb and CL s +b (C£f>) are the probabili- 
ties for the negative LLR value observed in simulated sig- 
nal+background (background) pseudo-experiments to be 
less signal-like than that observed in our data. The ratio 
of the resulting 95% C.L. upper limits to the SM predic- 
tions on production times branching ratio are shown in 
Fig. [3 and in Table El 

In summary we have searched for the SM Higgs bo- 
son in final states involving an electron or muon and a 
hadronically decaying tau, together with at least two jets. 
We set 95% C.L. limits on the ratio of the Higgs boson 
production cross section to that predicted in the SM of 
11.3 times for a Higgs boson mass of 125 GeV, to be 
compared with an expected ratio of 9.0. For a subsample 
enriched in H — > tt decays we observe (expect) a ratio 
of 12.8 (10.4) for M H = 125 GeV. These are the most 
stringent limits on Higgs boson production with H — > tt 
decay at the Tevatron to date. For an orthogonal sub- 
sample enriched in H — > WW decays the corresponding 
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FIG. 6: (color online) LLR distributions for (a) the T subsample, (b) the W subsample, and (c) the sum of the combined 
T and W subsamples. The firjj and erjj analyses are combined. The black dashed line shows the expected LLR for the 
background-only hypothesis and the red dash-dotted line shows the expected LLR for the signal-plus-background hypothesis. 
The solid black line indicates the observed LLR. The green (yellow) shaded bands indicate the ±1 s.d. (±2 s.d.) uncertainties 
on the expected background only LLR. 
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FIG. 7: (color online) The ratio of 95% C.L. upper limits on Higgs boson production for the (a) T subsample, (b) W subsample, 
and (c) the sum of the T and W subsamples. The firjj and erjj analyses are combined. The green (yellow) shaded bands 
indicate the ±1 s.d. (±2 s.d.) uncertainties on the expected limit ratios. 



ratios are 14.7 (11.5) at M H = 145 GeV. 
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